Semantic Indexing of Technical Documentation
نویسندگان
چکیده
This research takes place in an industrial context: the CONTINEW Company. This company ensures the storage and security of critical data and technical documentation. Consequently, it is necessary to organize these documents in order to retrieve quickly critical information. The management of this increasing volume of documents requires document classification which is based on indexing techniques. So, how much relevant the indexing phase is, more relevant the classification will be. The technical documentation is by nature strongly structured. For example, the logical structure describes the role and the nature of the document elements (introduction, title, section, and so one...) and the logical links between them (A chapter is composed of section and so one). Such structure facilitates document presentation and improves the indexing precision. The classical information retrieval systems use neither the logical structure, nor the concept contained in the textual content of documents. The document semantic is described by concepts belonging to a semantic resource. In this context, we propose a new semantic indexing model which exploits both the logical structures and the semantic contents of documents.
منابع مشابه
Building and Managing E-book Collections: A How-to-do-it Manual for Librarians
This book explores the analysis and interpretation, discovery and retrieval of a variety of non-textual objects, including image, music and moving image. Bringing together chapters written by leading experts in the field, this book provides an overview of the theoretical and academic aspects of digital cultural documentation and the state of the art. Case studies of digitization projects drawn ...
متن کاملOn the Semantification of 5-Star Technical Documentation
Technical documentation is a special purpose content describing machines and plants with high complexity. The documentation covers operation, maintenance and repair of the technical artifacts. The high complexity of the machines yields a voluminous documentation, where it increasingly becomes difficult to find the relevant information for a given problem. The paper discusses the use of semantic...
متن کاملImplementing CIDOC CRM Search Based on Fundamental Relations and OWLIM Rules
The CIDOC CRM provides an ontology for describing entities, properties and relationships appearing in cultural heritage (CH) documentation, history and archeology. CRM promotes shared understanding by providing an extensible semantic framework that any CH information can be mapped to. CRM data is usually represented in semantic web format (RDF) and comprises complex graphs of nodes and properti...
متن کاملSemantic Web Technologies in Technical Automotive Documentation
RDF is the format of choice to exchange data between software components of a corporate system. That’s why we decided to use it in a recent work at Renault, in the field of technical documentation. The prototype of a new repository for repair and diagnostic information was modeled with OWL. REST web services using RDF as data format were built on this repository, to provide access to improved r...
متن کاملExploiting semantic knowledge in LTAG-based controlled indexing of technical data
The work presented in this abstract follows the first experiments presented in (Lopez and Roussel, 2000) on the robust modeling of terms in the LTAG framework to index spoken annotation transcriptions. We continue to experiment the LTAG workbench (Lopez, 2000), and integrate it with on the shelftools (tenn extractor, taggers, terminological model) that embed and manage different kind of linguis...
متن کامل